2
Intro to Modern Mini-LLMs
PolyU COMP5511 Lab 9 | 2026-03-16
00:00

What is a Language Model?

A language model reads some text and predicts what comes next. Unlike image models that output a class label (e.g., "Cat" or "Dog"), a language model outputs the most probable next piece of text.

The Core Concept
A language model maps a sequence of inputs to a single output prediction:
  • Input Prompt: The capital of France is
  • Likely Next Token: Paris

If we repeat this prediction step many times—taking the new word, adding it to the input, and predicting again—the model can produce a whole sentence or paragraph.

This simple iterative process is the foundational idea behind all modern conversational chatbots and text generators, from miniature models to massive industry LLMs.

A clean, modern infographic showing a text prompt 'The capital of France is' entering a glowing neural network block labeled 'Language Model', and an output arrow pointing to the word 'Paris' highlighted as the predicted next token. Minimalist tech aesthetic with light colors.